Visual Objects Classification with Sliding Spatial Pyramid Matching

نویسندگان

  • Hao Wooi Lim
  • Yong Haur Tay
چکیده

We present a method for visual object classification using only a single feature, transformed color SIFT [15] with a variant of Spatial Pyramid Matching (SPM) that we called Sliding Spatial Pyramid Matching (SSPM), trained with an ensemble of linear regression (provided by LINEAR) to obtained state of the art result on Caltech-101 [22] of 83.46%. SSPM is a special version of SPM where instead of dividing an image into K number of regions, a subwindow of fixed size is slide around the image with a fixed step size. For each subwindow, a histogram of visual words is generated. To obtained the visual vocabulary, instead of performing K-means clustering [26], we randomly pick N exemplars from the training set and encode them with a soft non-linear mapping method from [16]. We then trained 15 models, each with a different visual word size with linear regression. All 15 models are then averaged together to form a single strong model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive Deep Pyramid Matching for Remote Sensing Scene Classification

Convolutional neural networks (CNNs) have attracted increasing attention in the remote sensing community. Most CNNs only take the last fully-connected layers as features for the classification of remotely sensed images, discarding the other convolutional layer features which may also be helpful for classification purposes. In this paper, we propose a new adaptive deep pyramid matching (ADPM) mo...

متن کامل

Detection and Classification of Multiple Objects using an RGB-D Sensor and Linear Spatial Pyramid Matching

This paper presents a complete system for multiple object detection and classification in a 3D scene using an RGB-D sensor such as the Microsoft Kinect sensor. Successful multiple object detection and classification are crucial features in many 3D computer vision applications. The main goal is making machines see and understand objects like humans do. To this goal, the new RGB-D sensors can be ...

متن کامل

Improved Spatial Pyramid Matching for Image Classification

Spatial analysis of salient feature points has been shown to be promising in image analysis and classification. In the past, spatial pyramid matching makes use of both of salient feature points and spatial multiresolution blocks to match between images. However, it is shown that different images or blocks can still have similar features using spatial pyramid matching. The analysis and matching ...

متن کامل

Image Classification Using Sparse Coding and Spatial Pyramid Matching

Recently, the Support Vector Machine (SVM) using Spatial Pyramid Matching (SPM) kernel has achieved remarkable successful in image classification. The classification accuracy can be improved further when combining the sparse coding with SPM. However, the existing methods give the same weight of patches of SPM at different levels. Clearly the discriminative powers of SPM at different levels are ...

متن کامل

Application of sparse coding with spatial pyramid matching for face expression classification

In this work I evaluate the performance of image classification method based on spatial pyramid matching for sparse codes, using the JAFFE database of facial expressions. I show that the method is comparable to other methods, typically used for such datasets, and I also introduce some attempts that I made towards the improvement of the algorithm.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1212.3767  شماره 

صفحات  -

تاریخ انتشار 2012